Evaluation of Loop Scheduling Algorithms on DistributedMemory Systems
نویسنده
چکیده
Loops are the largest source of parallelism in many applications. All prior DOALL loop scheduling algorithms such as Self-Scheduling, Guided Self-Scheduling, Trapezoid Self-Scheduling, and Factoring try to achieve workload balance through decreasing chunk sizes. Moreover, they have been analyzed only for shared memory platforms. In this work, the prior loop scheduling methods will be evaluated on two distributed memory machines using realistic workloads from the NAS Parallel benchmark suite and Livermore Loop Series. The distributed memory platforms are: a 16-node IBM SP2 and a 16-node nCUBE 2. The experimental results show that these decreasing chunk size methods tend to increase the communication time in distributed memory models by assigning more chunks. In view of these results, two new schemes, called Fixed Increase and Variable Increase, are introduced. Contrary to the earlier techniques, these schemes increase the chunk sizes in order to minimize the scheduling overhead by reducing interpro-cessor communication. The new algorithms can be implemented by parallel compilers and are scalable over large numbers of processors and iterations. Extensive measurements on both the machines indicate that the increasing chunk size methods can provide better performance than the existing algorithms for almost all workload patterns.
منابع مشابه
An Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملAn Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملPre-scheduling and Scheduling of Task Graph on Homogeneous Multiprocessor Systems
Task graph scheduling is a multi-objective optimization and NP-hard problem. In this paper a new algorithm on homogeneous multiprocessors systems is proposed. Basically, scheduling algorithms are targeted to balance the two parameters of time and energy consumption. These two parameters are up to a certain limit in contrast with each other and improvement of one causes reduction in the othe...
متن کاملA Scalable Scheduling Algorithm for Real-Time Distributed Systems
Scheduling real-time tasks in a distributed-memory multiprocessor is characterized as sequencing a set of tasks and assigning them to processors of the architecture. Realtime systems research has extensively investigated the sequencing dimension of the scheduling problem by extending uni-processor scheduling techniques to more complex architectures. We introduce a technique that uses an assignm...
متن کاملDevelopment of a Set of Algorithms for the Multi-Project Scheduling Problems
In this paper, the problem of determining the best schedule for a set of projects has been modeled in the form of a generalized tardiness flowshop (GTF) problem. We develop a set of heuristic algorithms for minimizing the total tardiness of jobs in a GTF problem. In the generalized version of tardiness flowshop problems, a job is considered to be a collection of operations and there is a due da...
متن کامل